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Abstract. We develop the theory of group actions in infinite di- 
mensional spaces and prove an abstract KAM theorem in this set- 
ting. Combining this theorem with Martinet's proof of versal de- 
formations for hypersurface singularities, we deduce a C°° Birkhoff 
normal form over a positive measure set. This answers positively 
Herman's invariant tori conjecture. 



Introduction 

Most perturbative series arising in mathematical physics diverge. 
Poincar's theorem for the case of the Delaunay-Lindstedt series in the 
restricted three body problem is among the most celebrated examples 
of this phenomenon. Following the work of Poincar, Siegel showed 
that the set of analytic functions for which these are convergent in the 
neighbourhood of a critical point is meager in Baire's sense [2H ESI ED] 
(see also [23]). 

Quite opposite in spirit to these negative results, Poincar and Siegel 
proved, in some cases, the convergence for the series which linearise 
a holomorphic vector field [26j |39]. These led to the invariant tori 
theorem of Kolmogorov and KAM theory [H [T7J [2T] . 

In the nineties, Herman asked for the existence of invariant tori in 
other situations. In 1998, during his ICM lecture, he made the follow- 
ing conjecture for discrete time hamiltonian systems [13] : 



In the neighbourhood of a diophantine elliptic fixed point, a real an- 
alytic symplectomorphism has a positive measure set of invariant tori. 

Herman conjectured also the following two variants for continuous 
time hamiltonian dynamics : 

In the neighbourhood of a torus carrying a quasi-periodic motion with 
diophantine frequency, a real analytic hamiltonian has a positive mea- 
sure set of invariant tori. 
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In the neighbourhood of an elliptic critical point with diophantine 
frequency, a real analytic hamiltonian has a positive measure set of in- 
variant tori. 

In this paper, we prove these conjectures by counterbalancing the 
negative results of Poincar and Siegel with the existence of a C°° 
Birkhoff normal form over a positive measure set, that is, a KAM 
version of the Birkhoff normal form. Our result is stronger than that 
conjectured by Herman : we will see that the diophantine condition 
can be relaxed and replaced by a condition of Bruno type. Moreover, 
the set of invariant tori is parametrised, not only by a positive measure 
set, but by a set of density equal to one at each of its point. 

Although the initial proof of Arnold is almost fifty years old, the 
proofs of the classical KAM theorem are still often cumbersome and 
many important details rely on tedious computations. To clarify the 
situation, Moser proposed a theoretical approach based on infinite di- 
mensional group actions and implicit function theorems [20] ■ After the 
work of Zehnder, this led Bost and Herman to a relatively simple proof 
of the invariant torus theorem [U H3] . 

Unlike this traditional approach, we consider direct limits of topo- 
logical vector space rather than inverse ones, LS-spaces rather than 
Frchet spaces. This restricts the field of application to the analytic 
setting. The resulting spaces do not satisfy the Baire property but this 
is of no importance for our purposes. So although, our approach looks 
from an exterior point of view very similar to the usual Frchet space 
setting, it is in fact different [HI EH Si]- Indeed, as Zehnder wrote ([HJ 
Chapter 5]), there is probably not much beyond the implicit function 
theorem in the differentiable case, while in the analytic case, there is 
a theory of group actions in infinite dimensional spaces, as suggested 
by Moser. The main point is that in the differentiable case, the use of 
implicit function theorems requires to invert the differential in a whole 
neighbourhood of the identity while in the analytic case, by avoiding 
implicit function theorems and working directly with group actions, 
we get a theorem which requires to invert the differential only at one 
point. We already observed this phenomenon with Fjoz in a simplified 
case [8]. 

Then, we go further and formulate KAM theory in this abstract 
setting. Both the classical KAM theorem and the Herman conjecture 
become particular cases of a unique generalised KAM theorem which 
occupies most of the paper. In this way, we unify KAM theory with 
standard deformation theory of maps and varieties and also with the 
theory of normal forms in semi-classical analysis [21 H21 EH] . 



THE HERMAN CONJECTURE 



3 



Now, how a generalisation of the KAM theorem might be related to 
the Birkhoff normal form might seem unclear at first glance. Indeed, 
while KAM theory deals with deformations of invariant lagrangian tori, 
there is apparently nothing that gets deformed in the computation of 
the Birkhoff normal form. In fact, there is some invariant lagrangian 
variety but it invisible from the real geometry since it has only one 
real point. Thus, after complexification, we are in a similar situation 
to Kolmogorov's version of the KAM theorem : there is an initial 
invariant lagrangian variety that we wish to deform into a family of 
invariant lagrangian varieties. 

Such situation, where a smooth fibre - in our case a torus- degener- 
ates into a singular one, is familiar to algebraic geometers although the 
case of lagrangian degenerations as only be treated more recently [53] 
(see also [TUl SI])- This interpretation of KAM theory in terms of 
deformation theory leads to a non trivial algorithm for the Birkhoff 
normal form, where non-degenerate deformations, in the sense of Kol- 
mogorov, play the role of versal deformations. So, combining KAM 
iteration with Martinet's homotopical trick leads to the construction 
of a Birkhoff normalisation over a positive measure set [19J. I could not 
prove such a result using the naive construction of the Birkhoff normal 
form. 

Whatever we chose for the normalisation process, the resulting for- 
mal power series is always the Birkhoff normal form and is therefore 
unique. In [TJ, Eliasson addressed the problem of convergence of this 
power series (see also [23]). Now, in light of our result, this happens 
to be a particular case of the more general problem, already stated by 
Sevryuk in [37] : what is the maximal Gevrey regularity that one can 
expect for the dependence of the KAM-tori with respect to parameters? 
It is rather doubtful that the answer would be that this dependence is 
analytic. 

In any case, both the KAM theorem and the Herman conjecture 
state that the existence of an invariant lagrangian variety implies the 
existence of an invariant lagrangian deformation of this variety over a 
positive measure set. One may ask if such a result holds in general, 
but this seems, for the moment, out of reach. 

Note that the deformations arising in KAM theory are not usual ones, 
in the sense of algebraic geometers. Like for the case of invariant tori 
in nearly integrable system, the family is parametrised by a completely 
disconnected set of positive measure. So, it seems that KAM theory is 
a part of a branch of mathematics which could be summarised by the 
motto : deformation theory for lagrangian varieties over disconnected 
basis. Moser's idea of reducing the usual KAM theorem to a parametric 
KAM theorem might be considered as the first avatar of it [21] (see also 
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1. Arithmetic density 



1.1. Arithmetic classes. 

Definition 1.1. A real positive increasing sequence p := (p n ) is called 
of exponential class A e M+ if there exists a positive real number B < A 
and a real constant C such that p n < Ce B " for all n's. 

The set of sequences with a given exponential class is stable under 
addition and multiplication. 

Definition 1.2. A real positive increasing sequence p := (p n ) is called 
of (C, r)-class if p n < C2 nT for all n's. 

We denote by [•] the integer part (also called the floor function) : 

x < [x] < x + 1, [x] e N. 

Remark that a (C, resequence is automatically of exponential class but 
in fact we have more : 

Lemma 1.1. For any (C, r) -sequence p, the sequence p' defined by 



is of exponential class 2 a with a :- 



M+i 



Proof. Without loss of generality we may assume that C > 1. We 
define 

t := 2 n/a . 

As (k\) n > 3- n k nk > 3~ n (nk)\, we have 

j^akr (3C ) ar ^f a ^ T 



(fc!)^ rn - (k\) ar ~ (ark)\ 
As t p /p\ < e*, we get the estimate 



(ark)] 



<e 3t . 
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This shows that 

/ < 3C(2V«)» 
fn — c ' 

and proves the lemma. □ 



This lemma is extracted from Popov's proof of the Gevrey regularity 
for invariant tori in perturbed integrable systems [27J. Although ele- 
mentary, it turns out to be essential in the proof of Gevrey regularity 
for KAM tori in the diophantine situation. 

Denote by (-, •) the euclidean scalar product in M d . For any vector 
a G M. d , we define the sequence o~(a) by : 

a(a) k := min{|(a,z)| : i E Z d \ {0}, ||i|| < 2 fe }. 

A vector is called (C, r)- diophantine if the sequence a(a)~ l of (C, r)- 
class. 

Siegel proved that a holomorphic vector field whose linear part has 
eigenvalues defining a (C, r)-vector is linearisable and Bruno extended 
this result to the vectors a which satisfy the condition [5j [39] : 

^ logor(a)„ 
— > < +oo. 

L / 2 n 

n>0 

This class contains those for which the sequence a[a)~ x is of exponen- 
tial type less than 2 which will be our main assumptions on frequencies. 
This condition is stronger than that of Bruno, but it is plausible that 
the result of this paper can be extended to this more general case. 

In KAM theory, one considers the linearisation of vector fields de- 
pending on parameters. The subsets of parameters that we consider 
will be of the following type : 

Definition 1.3. The arithmetic class in M. d associated to a real de- 
creasing sequence a = (a*) is the set 

6(a) := {aeW 1 : a{a) k > a k , Vfc}. 

Remark that arithmetic classes are closed subsets. We are in a situ- 
ation different from the KAM setting formulated by Arnold who con- 
sidered the union of some special kind of arithmetic classes pQ : 

Q T := {«6R": 3C, a{a) k > C2-' n \ \/k}. 

The subsets Q T are of full measure and this property is invariant 
under base changes which satisfy the Pyartli-Rssmann condition [281 
132] . In our situation, the set C(a) is, in general, not of full measure 
and might even contain isolated half-lines. 
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1.2. Statement of the theorem. We denote by U C M. d an open 
neighbourhood of the origin. 

Definition 1.4 ([Poj |To] |2%]). A C l -map 

f '■ U — > R n , x = (x u ...,x d )^ (fi(x), f n (x)) 

is called l-curved in a vector subspace V C MJ 1 at a point x G R d if : 

i) f(U) = f(x) + V ; 

ii) the partial derivatives {<9 J f(x) : \j\ < 1} generate the vector-space 

V where j = (ji , j 2 • • • , 3d) is a multi-index and \j \ — j\ + J2 H h 

3 d- 

Such mappings are also called non- degenerate. Sometimes we omit 
to specify V or /. 

Let us recall the definition of density, in the measure theoretical 
sense. For a G M. n , we denote by B(a,r) the ball centred at a with 
radius r. The density of a measurable subset K C MJ 1 at a point a is 
the limit (if it exists) : 

Vo\(KnB(a,r)) 
hm — . 

r->o Vol(B(a,r)) 

The density of a measurable subset is equal to 1 at almost all of its 
point [IS]. For instance, sets of zero Lebesgue measure have density 
equal to one at almost all points and, in fact, equal to zero at all points. 

Theorem 1.1 QllJ). Consider a real positive decreasing sequence a = 
(afc) and define the sequence a' = (a' fc ) by 

I _ -fen-fc/(n+l)-(n+l) 2 M^(«+l)' 

a k .— z a k 
For any l-curved mapping 

f = (fi, ■ ■ ■ , fn) : U — > R n , f(0) = ae e(o) 
the density of the set f~ 1 (G(a')) at the origin is equal to 1. 

2. The C°° Birkhoff normal form 

2.1. The frequency space. Let us consider the vector space M. 2n with 
coordinates (&,Pi, i = 1, . . . , n and the standard symplectic form : 



u : = 



} J dqi A dpi 



Recall that a C°° diffeomorphism is called a symplectomorphism if it 
preserves the symplectic form. 

If /, g are series in the q, p variables, we use the notations 

f = 9 + o(l) 
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whenever the terms arising in series f—g are of degree at least / + We 
denote by f( k ' the homogeneous component of degree k in /. We say 
that / is of order I if / = o(l — 1). For a multi-index I = (i 1; i 2 , . . . , J n ) 
we use the notation 

X 1 = Xl 1 . . . X 1 ™ and \I\ = h + I 2 + --- + I n . 

Let H : (R 2ri , 0) — )■ (R, 0) be an analytic function germ with a pos- 
itive definite hessian. In such a situation, the critical point is called 
elliptic. Up to a symplectic linear change of coordinates, we may as- 
sume that H is of the form 

n 

IT = $>(p? + g?) + o(2) 
i=i 

The vector 

a = (an,..., ot n ) G R n 
is called the frequency vector, or simply the frequency, of if at the 
critical point. 

Proposition 2.1. Lei H : (R 2n ,0) — )• (R, 0) fre an analytic function 
germ with an elliptic critical point at the origin. If the coordinates 
of the frequency vector are linearly independent over Q then, for any 
k > 0, there exists a symplectomorphism ip^ '■ (R 2n ,0) — > (R 2 ™, 0) and 
homogeneous polynomials Df = J2\i\=j a i^ 1 °f degree j < k such that 

k 

(p k (H) = Df{qiPi, q n Pn) + o(2k) 

3=1 

with 

Df(q lPl , q nVn ) := ^ ai {p\ + qfY^pj + qj) h . . . (p 2 n + g 2 ) 7 ". 

\I\=3 

Moreover, the polynomials do not depend on the choice of Lp k . 

This proposition was attributed by Poincar to Delaunay and Lind- 
stedt, it is sometimes also attributed to Birkhoff [3J |25]. We will give 
a proof using a fast iteration in 13.11 

The polynomials Dj? will be called the Delaunay-Lindstedt polyno- 
mials. In particular, there is a uniquely defined formal power series 

Z H = J2 D ? e^[[Ai,---,A n ]], 

3>1 

called the Birkhoff normal form of H, such that B H (qipi, . . . , q n p n ) is 
conjugated to if by a formal symplectomorphism. 

The Delaunay-Lindstedt polynomials of if define polynomial map- 
pings 

VL>f = (d Al Df , . . . , d Xn D» ) : R» — ► R« 
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Definition 2.1. The frequency space of H , denoted 3(H) C IR n ; is the 
smallest vector subspace of W 1 which contains the images of the V 
for any k > 2. 

From a scheme-theoretical point of view, the gradient of B H induces 
a map of formal schemes 

Spec (C[[Ai, . . . , \ n ]] n ) — ► Spec (C[[*i, . . . , t n ]] n ) 

induced by substituting tj with d\H H — oii and 3(H) is smallest vector 
space which contains the image of this map. 

If the quadratic form is non degenerate then 3(H) = M. n . These 
will be called the nondegenerated cases in the sense of Kolmogorov. 
Remark that, by definition, the frequency space of a hamiltonian is 
invariant under the action of symplectomorphisms : 

3(H oip) = 3(H) 

for any symplectomorphism tp : (C 2ra ,0) — > (C 2n ,0). 

2.2. Statement of the theorems. Let us now extend the notion of 
symplectomorphism to closed subsets. A C k -symplectomorphism (k > 
1 or k = oo), between closed subsets X,X' C M 2n : 

if : X — >■ X', <p(X) = X' 

is the restriction to X of a C fc -diffeomorphism $ defined on an open 
subset U containing X such that 

(®*u) x = co, Vx G X'. 

More generally, a Poisson morphism from a subset of a Poisson manifold 
is the restriction of a C fc -mapping which preserves the Poisson structure 
over the subset. 

Consider the map 

n . R 2n _^ Rn> ^ {p 2 + g 2 p 2 + £ ^ + g 2 } _ 

This paper will mainly be concerned in proving the 

Theorem 2.1. Let a = (aj) be a decreasing sequence such that or 1 is 
of exponential class 2. Consider an analytic function germ 

H : (R 2n ,0) — ► (R,0) 

having an elliptic critical point at the origin with frequency 

a = (cti, . . . , a n ) G Q(a). 

There exists a C°° map germ B H : (R n , 0) — > (R, 0) and a C°° sym- 
plectomophism germ : 

tp:(X,0)^(X',0), 

with 

X := 7T- 1 o (VB H )- 1 (e(a)), X' = <p(X) 
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such that 

i) Ho ( p = B H {p 2 + q 2 ,...,p 2 n + q 2 ) ; 

ii) the Taylor series of B H at the origin is equal to the Birkhoff normal 
form of H ; 

iii) the restriction of (p to the fibres of the map it is analytic ; 

iv) the mapping VB H is curved in J(H) . 

Moreover if a is of (C, r)-class, for some C,t > 0, then if and B can 
be chosen of Gevrey class [r] +2. 

(Recall that a C°° function / is of Gevrey class j over a closed subset 
K c R d if 

\dPf{z)\ 
SUP (\8\W < + °°' 

With |0| = &+& + ••• + &.) 

In case the frequency space is a point, then the set X is an open 
neighbourhood of the origin and the theorem asserts that H is inte- 
grable in this neighbourhood. This generalises a theorem proved by 
Rssmann in the diophantine case [3 1J . In the other extreme, if the 
frequency space equals R n , i.e., if H is non-degenerate, in the sense of 
Kolomogorov, then the theorem becomes a local variant of the KAM 
theorem. 

A similar theorem holds for an arbitrary quadratic part (not neces- 
sarily elliptic) and for the complex holomorphic situation. 

Denote by T n the real n-dimensional torus, that we identify with the 
zero section of its cotangent space 

T*T n «T"xl" = {( Ql , ...,q n , Pl ,.. .,p n )}, ft G (M/2ttZ) )K Gl. 

We equip this space with the standard symplectic form 



22 dqi A dpi. 



u : = 

i=\ 

To avoid introducing new notations, we denote by 7r the map 
vr : T n x W — > R n , (q,p) ^ ( Pl ,p 2 , . . .,p n ). 

Recall that a hamiltonian function defines a quasi-periodic motion 
with frequency a = (a±, . . . , a n ) G M n along T n if the restriction of its 
hamiltonian vector field to T n is linear of the form Y^7=i a $qi- m such 
a case, up to a constant, H is of the form 

n 

2j OLi Pi + 2J Otij PiPj 
1=1 

where the a^- are analytic functions of the ftp's in a common neigh- 
bourhood of T n . 
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Theorem 2.2. Let a = (a,) be a decreasing sequence such that a 1 is 
of exponential class 2. Consider an analytic function germ 

H : (T*T n , T n ) — > (R,0) 

defining a quasi-periodic motion along T n with frequency 

a = (ai, . . . , a n ) G C(a). 

There exists a C°° map germ B H : (IR n , 0) — > (R, 0) and a C°° sym- 
plectomophism germ : 

(p: (X,0)^(X',0), 

with 

X := 7T- 1 o (V^)^(e(a)), X' = <p(X) 

such that 

i) H o(p = B H (p u ...,p n ) ; 

ii) the Taylor series of B H is equal to the Birkhoff normal form of H 

iii) the restriction of tp to the fibres of the map it is analytic ; 

iv) the mapping VB H is curved in 3(H) . 

Moreover if a is of (C, r)-class, for some C,t > 0, then if and B H can 
be chosen of Gevrey class [r] +2. 

The theorems stated above concern neighbourhoods of invariant tori 
of dimension and n, there are similar theorems for invariant isotropic 
submanifolds of arbitrary dimension. All cases can proved along the 
same lines, so we will later give the proof only for Theorem 12. 11 For the 
moment, let us observe that these results imply the following strength- 
ening of Herman's conjecture. 

Theorem 2.3. Let H : (R 2n , 0) — > (M, 0) be an analytic function germ 
having an elliptic critical point at the origin with frequency a. If the 
sequence a(a)~ l is of exponential class 2 then H admits a representative 
whose invariant tori form a set of density one at each point. 

Proof of Theorem \2.3[ We apply Theorem 12.11 to the sequence 

/ ._ r)-kn-kl(n+l)~(n+l) 2 kdl(n+l)l 

a k .— z a k ■ 

By assumption V-B^(O) = a G C(a) C G(a'), thus Theorem 11.11 implies 
that the set S := (V-B i/ )~ 1 (C(a')) has density one at the origin. It 
follows that the preimage X of S under the map 

n . M 2„ _^ R n (?) p )^(ql+pl,...,fi + pi) 

and its image X' = f(X) under (p have both density one at the ori- 
gin. More generally, by a similar argument, we get that in the neigh- 
bourhood of an invariant torus of arbitrary dimension, there is a set 
of density one consisting of n-dimensional invariant tori. This proves 
Theorem 12.31 □ 
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2.3. KAM theorem without non degeneracy assumptions. As 

we shall see, there is no difficulty in introducing parameters in Theo- 
rem 12.11 and 12.21 Let us for instance consider the case of Theorem 12. 2[ 

Denote by M the product of the real line with the cotangent space 
to the n-dimensional torus with variables qj G (R/27rZ), Pj G R , for 
j = l,...,n: 

M:=Kx T*T n = {(t,q,p)} 

Consider the Poisson structure induced by that of the cotangent bundle 
to the torus : 

n 

We identify T n with the cartesian product of the zero section in the 
cotangent space with {t = 0} and denote by ir the map 

7T : M > R n+1 , (t,q,p)^(t,p). 

We use the notation V := (d Pl , . . . , d Pn ) for the gradient in the p vari- 
ables. 

Theorem 2.4. Let a = (aj) be a decreasing sequence such that a^ 1 is 
of exponential class 2. Consider an analytic function germ 

H : (M, T n ) — ► R 

such that H(t = 0, — ) defines a quasi-periodic motion with frequency 

a = (an, ■ ■ ■ , 0! n ) G 6(a) 

along T n . There exists a C°° map germ 

B H : (R x M n ,0) — > R, (t,p) \-> B H (t,p) 

and a C°° symplectomorphism germ : 

ip: (X,0)^(X',0), 

with 

X := 7T- 1 o (VB H )-\e(a)), X' = ip(X) 

such that 

i) H o tp = B H (t,p u ...,p n ) ; 

ii) the restriction of <p to the fibres of ir is analytic ; 

iii) the mapping VB H is curved in J(H) . 

Moreover if a is of (C, r)-class, for some C,r > 0, then if and B H can 
be chosen of Gevrey class [r] +2. 

This gives a variant of Kolmogorov's original theorem : the existence 
of a single invariant tori at t = with diophantine quasi-periodic mo- 
tion implies the existence of a positive measure set of such tori in the 
product space R 2ri x R, without any non- degeneracy assumption. 
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The reader might compare this result with a theorem of Katok [T%1 
Theorem A], with Rssman's version of the KAM theorem [32] and with 
the example given by Sevryuk in [37J Section 2]. 

3. The iterative process for the Birkhoff normal form 

3.1. Formal Birkhoff normal form by a fast iteration. We shall 
now see that the formal Birkhoff normal form can be obtained by a 
fast iteration. This iteration plays an intermediate role between the 
usual one and the one that we will use to prove the existence of a C°° 
Birkhoff normal form (Theorem 12. ip . 

The real structures will play essentially no role in the sequel. Thus, 
we consider the vector space C 2 " with coordinates qi,Pi, i = 1, . . . ,n 
and the complex holomorphic symplectic form : 

n 

u := dqi A dpi. 
i=l 

Denote by C[[g,p]] the ring of formal power series in the variables 
q = (qi,q 2 , ...,q n ), p = (pi,p 2 , • • • ,Pn)- 

The symplectic form induces a Poisson structure on this ring defined 
by 

n 

{f, 9} ■■= J2 d iJ d P>9 - d <n9 d P J- 

i=l 

Let H G C[[g,p]] be a series with a non-degenerate hessian. Up to a 
symplectic linear change of coordinates, we may assume that H is of 
the form 

n 

^aiPiqi + o(2). 
i=l 

Like in the real case, the vector 

a = («!, ...,a n ) E C n 

is called the frequency vector (or simply the frequency) of H at the 
critical point. Put H := and assume that the coordinates of the 
frequency vector are linearly independent over Q. In this case, the 
kernel of the linear mapping 

C[[q,p]] — ► C[[q,p]}, f^{f,H } 

is the algebra A of formal power series in 

Piqi, P2<?2, • • • , p n q n - 

Note that projecting A to the cokernel of the map {— ,H } gives an 
isomorphism, thus we have an exact sequence of algebras 

_> A — > C[[q,p}} { ^* } C[[q,p}} ^A^O. 
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We now define the iteration. At the first step, we have 

n 

H := H^ = "iPifc, Ro-=H- H . 
i=i 

Once H k , R k are known the next term is constructed as follows. 

There exists a (non-unique) polynomial Pk G C[g,p] of order 2 k + 2 
and a unique B k G A such that 

i?f +2) + ---+i?r i+1) = m ) ^}+^. 

We consider the symplectic automorphism e~{ Pk ~} of the ring C[[q, p}} : 
f^f- {P k , /} + |{P fe , {P k , /}} - ^{P fe , {P fc , {P fe , /}}} + . . . 

(This automorphism can be interpreted as the hamiltonian flow of —P k 
at time t = 1.) 

The iteration is defined by 

JPfc+i '■= H k + B k , 

\R k+1 := e-^-\H k + R k ) - H k+1 . 

Thus the polynomial B k can be expressed in terms of Delaunay-Lindstedt 
polynomials : 

2 fe 

B k (q u ...,q n ,p u ...,p n ) = D ?(Pi<liT--iPnqn), 

j=2 k - 1 +l 

for k > and P = 0. 

Let us spell out the first steps of this induction. For k = 0, we chose 
a homogeneous polynomial Po of degree 3 such that 

R^ := {^0, Po}- 
In that case Bq = and we have 

e-{ p °'->(P + Po) = #i + Pi 

with Hi = H Q . Remark that, while P was of order 3, the series Pi is 
now of order 4 : 

Ri = e -{Po,-}( Ho + _ Ho = Ho + Rg _ (p 0) Hq} _ Hq + o(4 ) = o(4 ) 

To perform the next step, choose polynomials Pi G C[[g,p]], Pi G A of 
order 4 such that 

Pj 4) + PS 5) = {Pi,Pi} +Pl 

Write 

H 2 := Pi + Pi, P 2 := e-{ Pl '->(P 1 + Pi) - P 2 

and repeat the procedure. Note that, while Pi was of order 4, the 
series P 2 is now of order 6 and not only of order 5 : 

P 2 = e-{ Pl <-}(Pi+Pi)-P 2 = Pi+Pi-{Pi,Pi}-Pi-Pi+o(6) = o(6). 
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In this way, we construct step by step a symplectic automorphism 
( p k = e-t p >>- 1 >-*...e-V*>-> 

such that 

H o tp k = H k + R k 

with 

H k = ^Dffaqt, . . .,p n q n ) G A, R k = o{2 k + 1). 
3=1 

As the Poisson bracket decreases the degree by 2 and Pk is of order 
2 k + 2, in the expansion of the exponential 

H k - {P k , H k } + ^{P k , {P k , H k }} + ... 

the quadratic term {P k , {P k , H k }} might contain terms of degree 

2(2 fc + 2) + 2 - 4 = 2 k+1 + 2. 

Thus, as long as we compute with formal power series, we cannot do 
better than solving the linearised equations up to order 2 fe+1 + 1. How- 
ever, in the sequel, we will be interested by estimates for some supre- 
mum norms, thus we will increase the order of approximation accord- 
ingly 

The real case offers no difficulty If 

t : C d — >C d 

denotes an anti-holomorphic involution. We say that a formal power 
series f(z) is r-real if f{rz) = f(z). If H is r-real then the Pj's can also 
be chosen of this sort and the corresponding symplectic automorphism 
preserves the real structure. 

3.2. The invariant lagrangian variety. According to Siegel, for a 
generic H any symplectic automorphism which conjugates H to its 
Birkhoff normal form is divergent [381 SO] ( see a l so [23J). So, we modify 
the normal form constructed by the previous Ansatz in two steps. 

Denote by 0o the sheaf of holomorphic functions in C n . The stalk 
of this sheaf at the origin is a local ring and we denote by Mjj.„ the 
powers of its maximal ideal : 

M P ,o := {/ e 0e>,o : /(0) = 0}. 
Theorem 3.1. Consider the quadratic polynomial 

n 

H(q,p) = ^(XiPiqi G C[q,p] 
i=i 



THE HERMAN CONJECTURE 15 

If the frequency vector a = («i, . . . , ot n ) is such that a(a)~ l is of expo- 
nential class 2 then for any holomorphic function germ of the type 

H + R, R G M^2n o 

there exists a symplectic automorphism <p G Aut (Oc 2n ,o) su °h that 

<p(H + R) = H (mod/ 2 ) 

where I C Oc 2 ™,o is the ideal generated by piqi, P2<?2, • • • , p n q n - 

Note that the construction of the Birkhoff normal form implies the 
existence of a symplectomorphism ip^ such that 

<p N (H + R) = H + o(N) (mod J 2 ), 

for any N > 0. 

Theorem 13.11 implies that, in a sufficiently small neighbourhood of 
the origin, the representatives of the germ H + R admit an invariant 
complex lagrangian variety locally symplectomorphic to 

{{q,p) G C 2ri : pxqx = p 2 q 2 = ■■■ = p n q n = 0}. 

This results shows the relation between the statements a hamiltonian 
system has a critical point and a a hamiltonian system has an invariant 
torus : in both cases, it has an invariant complex lagrangian variety. 

3.3. Algorithm for proving Theorem 13.11 In order to prove the 
theorem, we introduce a second graduation of C[[g,p]] defined by 

hdeg(p i q!) = [\\i-j\\], i,j G N n C W n 

where [— ] denotes the entire part. We will refer to harmonic- degree for 
the degree with respect to this graduation and use the word degree for 
the usual polynomial degree. We denote by 5F' the following associated 
filtration : 

\\i-j\\>2 k 

Let Vfc C M^n o be the subspace of formal power series with harmonic- 
degree at most 2 k : 

Vk-={ ai i qipj : ai i G C ) n M c2",o- 

||i-j||<2 fc 

Put H = H and R = R G M^ 2 „ . 

Our algorithm depends on the choice of a natural number N > 
which fixes the order of approximation for the first steps. At n — 0, we 
choose holomorphic function germs P G Vo, B G 1$, such that 

{P ,H } + B = R Q mod^ N . 

Define 

H x := + B , Ri := e^ p °'-\H + R ) - Hi 
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To repeat this process, we first define the function 

: N — > N 

by 

, fiV for n < N 
*W = \n for n>iV 

At the step, we write 

Hk = Hk-\ + B k -i, B k -i G I 2 

and solve the infinitesimal equation : 

{P fe , + B k = R k mod 3^ 

with Pk G V^(fe), -Bfc G I 2 fl V^(fc). Then, like in the formal case, we 
define 

.iWi := e-^>->(fT fc + i2fc) - if fc+1 . 

We construct symplectomorphisms 

Vk : = e -{^-} e -{^-i->...e-^-> 

which "almost" conjugate H + R to H modulo J 2 for both nitrations. 
In particular, the degrees in the q,p- variables increase. 

An important ingredient concerning this iteration is that we can 
compute explicitly a solution to the infinitesimal equation, as we shall 
now see. 

3.4. Solving the infinitesimal equation. Observe that if F is a 
vector subspace of a vector space E then the restriction of the canonical 
projection to a supplementary subspace E 1 C E gives an isomorphism 

E/F « £". 

We put I k — I fl V k , I k — I 2 f]V k and we apply this principle to the 
exact sequence of vector spaces : 

o — ► 4// 2 — ► \4// 2 — ► \4/4 — ► o. 

The quotient space V k /I k gets identified with the sum of the subspace 
Ak generated by the monomials 

with iiji = ■ ■ ■ = i n j n = and \\i — j\\ < 2 fc 

which belong to the ideal M^ 2 n with the vector space 

n 
i=l 

Under this identification the isomorphism given by splitting the exact 
sequence 

V k /I 2 » Vk/h © hill 
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consists in taking the sum : 

A k © "B k — > Vk/Il, (a, b) ^ a + b 

Now, at the k th step, we wish to find an inverse of the map 
p k :Vk/I 2 k^Vk/I 2 k , f^{f,H k }. 

To do this, first recall that the Hadamard product of two series 

i£N n ieN n 

is defined by 

f*9-= Y aihiZt - 

The map induced by pk on Vk/Ik ~ Ak is simply the Hadamard 
product with the power series 

/ : = J2(a,{i- j))p i q j , 

in particular 

{-,H k } = {-,H } mod/. 

Thus, in Ak, a right-inverse is given by the Hadamard product with 
the holomorphic function germ 

g k = Yl K(*-j)rW- 

0<IK-j||<2 fc 

We now lift this right-inverse from Vk/h ~ Ak to Vk/1% ~ Ak © 23fc. 
Via the identification of Vk/I% with ,A fc © ¥> k , the map p fc admits the 
lower triangular decomposition 

( f* A 

Pfc = r y^A: D -, , , : A k © Sfc — >■ A k © Sfc. 

Thus a right-inverse map is given by 

. . = ( 9k* \ 

3k ' \-9k*{-,Y,i=i B i}9k* 9k* J 

that is : 

k 

jk : Ak®T>k — yAk^^k ■ (a, 6) i— >■ (gk*a,gk*b-g k *{gk*a, ^ 

i=l 

In this way, we solved explicitly our equations at each step. 
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3.5. Lagrangian normal forms. We construct the C°°-Birkhoff nor- 
mal form by extending the construction of Theorem 13.11 to the case 
where the functions depend on parameters. This involves a generalisa- 
tion of the usual Birkhoff normal form. 

Consider the algebra of formal power series C[[A,g,p]] with A = 
(Ai, . . . , A n ). We extend the graduation by the polynomial degree by 
putting the degree of the Aj's equal to 2. 

The bivector 

n 
i=l 

induces a C[[A]]-linear Poisson structure on C[[A, q,p))- 

Denote by / C C[[A,g,p]] the ideal generated by the — Aj's for 
i — 1, . . . ,n. If two series are equal modulo the square of the ideal 
/ then they define the same hamiltonian derivation of C[[A, q,p]]/I. 
Indeed for any g,h <E I and any H G C[[A, q,p]}, we have : 

{H + gh, -} = {H,-} + g{h, -} + h{g, -} = {H, -} (mod I). 

In a more abstract way, the ideal / defines a Lagrangian deformation 
thus the conormal module I /I 2 gets identified with the tangent module 
of the deformation. 

The inclusion C[[g,p]] C C[[A,g,p]] induces an isomorphism of Pois- 
son algebras 

C[[q,p}]^C[[\,q,p}}/I. 

Thus any hamiltonian derivation in C[[g,p]] can be identified with a 
hamiltonian derivation in C[[A, q,p]]/I. 

To see how the Birkhoff normal form can be written via this iso- 
morphism, put qp = (qxPi, . . . , q n p n )- Consider a formal power series 
H G C[[g,p]] such that the components of the frequency vector are lin- 
early independent over Q. In that case, H can be reduced to a Birkhoff 
normal form B H via a formal symplectomorphism. 

The Taylor formula at order one gives : 

n 

B H (qp) = B H (X) +J2dx i B H (X)(q i p i - A^mod J 2 ). 

Therefore the series B H (qp) and Yl^i^XtB 11 (X)(qiPi — Aj) define the 
same hamiltonian derivation of C[[A, q,p]]/I. Thus, the usual Birkhoff 
normal form and the right-hand side of this equality contains the same 
information concerning the hamiltonian derivation {— ,H} in ring of 
formal power series. 

More generally, we may take for / the ideal generated by the piqi — 
gi(\) where g = (gi, . . . , g n ). These lagrangian normal forms are more 
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general than Birkhoff normal forms since the frequency mapping does 
not need to be a gradient, namely : 

Proposition 3.1. Consider a holomorphic function germ 
H : (C k x C 2 " , 0) — ► (C, 0), (A, q,p) M- H(X, q, p) 

such that 

n 

H(X = 0,q,p) = ^ ctiViqi + o(2) 
i=i 

If the ojj 's are linearly independent over Q £/ien ; /or any > ; there 
exists the germ of a Poisson mapping ip^ : (C fc x C 2n , 0) — > (C fe x 
C 2n , 0) and polynomials gi^, ■ ■ ■ , Qn,fc £ C[A] o/ degree k — 1 swca iaai 

n 

=X)ft ) fc(A)p i g f + o(2A;) (mod/ 2 ©C[A]) 
i=i 

Moreover, the polynomials g% y ki ■ ■ ■ ,9n,k do not depend on the choice of 

The notion of frequency space and Kolmogorov's non-degeneracy 
condition extend immediately to this setting. 

3.6. Martinet's trick in the KAM context. The nave algorithm 
of the fast Birkhoff normal form (see 13 . 1 j) would require to construct 
the set of invariant tori step by step, like in Arnold's proof of the KAM 
theorem. We will proceed differently : using the lagrangian normal 
form, we interpolate our initial hamiltonian with a non-degenerate one, 
in the sense of Kolmogorov. This produces a homotopy between our 
functions. Our aim is to prove that this homotopy is given by a change 
of variables which preserves the Poisson structure. This artefact was 
invented by Martinet in the context of Thorn-Mather theory |19j . 

Consider the algebra C[[A, fi, q,p}} of formal power series in the 4n 
variables Ai, . . . , A n , /ii, . . . , /i n , q\, . . . , q n , Pi, ■ ■ ■ ,p n - We extend the 
graduation by the polynomial degree in C[[g,p]] by assigning the degree 
2 to the Aj's and the /Vs : 

deg(Xi) = deg(fii) = 2, Vz = 1, . . . , n. 

The bivector 

n 
i=l 

induces a C[[A, /x]]-linear Poisson structure on C[[A, //, q,p]\- 

Denote by / C C[[A, //, q, p}} the ideal generated by the piq. t — A« — //j's 
for i = 1, . . . , n. 
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Let us now consider a power series of the type : 



n 



H(q,p) = ^aiPiqi + R , R := D"(qm, ■ ■ .,q„p n ) + o(4), 



i=i 



where the ctj's are linearly independent over Q. Here a = (a±, . . . , a n ) 
is the frequency at the origin and is the second Delaunay-Lindstedt 
polynomial of H . 

The polynomial is homogeneous of degree 2, so that the poly- 
nomial D^iqiPi, • • • , q n Pn) is homogeneous of degree 4, and we denote 
by Q the associated symmetric n x n-matrix. Next, define the matrix 
A = Id — 2Q and consider the polynomial 



Observe that the restriction of Go + -Ro to /x = is equal to H and, 
modulo I 2 , G + is equal to 



reduces G + R to G . 

Assume that, for any k > 0, there exists a Poisson automorphism ip k 
such that 



This Poisson automorphism </? fe induces a biholomorphic map, that we 
denote in the same way, between neighbourhoods of the origin in C n x 



Let U k be a neighbourhood of the origin in C n x C n over which is 
biholomorphic. The image by the automorphism (p k of the projection 
map 

(A,/i) H> /i 

defines a map germ in the variables A ;/ u. The submanifold 

M k :={(\,fi)EU k : A k (\,fi)=0}. 

is the image of the vector space /i = under As ip k is tangent to 
the identity, it is the graph of a function g k : 





In this way, the linear change of variables 



<^(G + i? ) = G + o(2fc) (mod/ 2 © C{A}) 



C« = {(A,/i)}. 



M fe := {(A,/i) e C4 : /i = ^(A)}. 



THE HERMAN CONJECTURE 



21 



As the restriction of G + Rq to fi = is equal to H , we get that : 

n 

(fik(H) = (G + o{2k))\ Mk = ^2gi, k (^)Piqi + o(2k) 

i=l 

where = stands for an equality modulo I 2 ® C{A} and g k = (gi,k, ■ ■ ■ , <?n,fc)- 
The unicity of the Birkhoff normal form implies that the first terms in 
the Taylor expansion of gk are given by the gradient of the Delaunay- 
Lindstedt polynomials : 

k 

g k = J2 VD ? + °( 2k )- 

Using the relation between Birkhoff normal forms and lagrangian ones, 
we get that : 

k 

<p k (H) = Df(qiPi, q n Pn) + o(2k). 

i=0 

This shows that after introducing the map Go an d constructing the 
Poisson automorphisms ip k , we can recover the usual Birkhoff normal 
form of our initial hamiltonian H. We now proceed to the linearised 
problem involved in the construction of the ipkS. 

3.7. The linearised equations for the lagrangian normal form. 

Consider the algebras 

£ := C[[A,/i,g,p]]// 2 , S := C[[A,/i]], := Der c[[A]] (S) = ©? =1 C[[A,/i]]9 w . 

The nave filtration by the degree does not satisfy the properties that 
we shall require for proving the convergence of our iteration. Thus, we 
extend the harmonic graduation which associates the degree [||i — 
to p l qi to C[[A, /i, q,p}} by putting : 

hdeg(Xi) = hdeg(jii) = 0, Mi = 1, . . . , n. 

So that now 

? k = { E ^v, ^ e s >- 

||i-j||>2 fe 

Denote by V k the space of formal power series with harmonic-degree 
at most 2 h : 



v k'-= { ^2 : aij e §}• 



IK-ill<2 fc 

Put 

h = inv k , i 2 = i 2 nv k , F k = i 2 ®§. 

The Poisson bracket induces a map 

A fe :\4/4 2 ^Der s (£), / ^ {/, -}. 
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We define 

So = 9, Q k ■= Im At, fc > 0. 
and construct an inverse to the map 

pk(C) : g k — ► V k /F k , v ^ v(G + C) 

for any C G if of order 5. 

We identify the quotient space V k /F k with the sum of the S-sub modules 
A, H> k , Cfc where A and T> k are respectively generated by the following 
monomials : 

(1) Ptft's ; 

(2) pV s with = ■ ■ ■ = i n j n = and < ||z — j\\ < 2 h ; 

and 

n 

G k = 0PigiS fc . 

i=i 

Let us first consider the case k — 0. The S-linear map 

p (C) : e — ► .a 

is given by an n x n matrix of the type 

Id + R c . 

I assert that the matrix Rc vanishes for g = 0, p — 0, A = = 0. As 
C belongs to J 2 , it is of the form : 

n 

C =^ a ij(PiQi VdiPjQj ~ - /ij), <Hj 6 C[[A,//]], 

an since C is of order 5, the vanish at the origin. Moreover, 

n 

d M C = - (ay + a^faqi -h-y-i)- 

Thus, the matrix i?c* has coefficients — (a^ + Ojj) and the a^-'s vanish 
at the origin ; this proves the assertion. 

For the case k > 0, the situation is similar to that of 13.41 In the 
invariant space A © S fc © C fc , the operator p k {C) admits the lower 
triangular decomposition 

f Po (C) \ 
p k (C) = /* 

V o {- c} /*/ 

with 

#3 
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Define the function 

g k (X,fi,q,p) := ^ (a + //, (i - j)) _1 pV . 

0<\\i~j\\<2 k 

The map 

fjo(C) 

3k{C) := g k * 

\ -#&*{-, C}^* g k -k / 

is a right-inverse to p k (C). 

3.8. The iterative procedure. We choose N > 0, the convergence of 
algorithm that we describe holds only for N big enough. Next, define 
the function <f> as before : 

, fJV for n<N 
^ = \n for n>iV 

and decompose Ro as 

i? = So + T with S G V m , T G ^ (0) . 

Consider the derivation 

u := j 4,(0) (0) (S ), B := S - u (G ). 

Put Gi = Go + -Bo and define i?i by 

e- U0 (G + R ) = G 1 + R 1 . 

At the next step, we write 

R! = S! + Ti with Si G ^(i), T x G 3^ (1) 

and put 

«i := j Hl) (B )(S 1 ), B 1 := S 1 - u 1 (G 1 ). 
In this way, we construct stepwise sequences 

K), u k G g, (B k ), B k G I 2 k © C[[A, /x]] 
with G k+ i = G + B + --- + B k , 

e-^{G k + R k )=G k+1 + R k+l 

and 

B k+ i = S k — u k+ i(G k+ i) 

where S k is the sum of terms of harmonic-degree at most 2^ appearing 
in the expansion of R k . 

The automorphism 

<p k ■= e -"* e -«fc-i . . .e~ U0 

sends G + -Ro to G k+ \ + -Rfc+i- Note that all the GVs define the same 
hamiltonian derivation of the algebra Ji. 
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3.9. Convergence of the Ansatz. We use the notations of 13.61 and 
denote by C{z} C C[[z]} the germs at the origin of holomorphic func- 
tions in the variable z = (zi, . . . , z\). Let us now assume that our initial 
hamiltonian function H satisfies the assumptions of Theorem 12.11 

In the rest of the paper, we will prove that, in the C°°-topology of 
Whitney differentiable functions, the process described in [32] converges 
for fi G —a + G(a) and X,q,pG C n sufficiently close to the origin, and 
N big enough. Thus there exists a Poisson morphism 

V (G + Ro) = G . 

where = means an equality modulo an element of J 2 and a Whitney 
differentiable function depending only on A, /x. 

As (ifk) is convergent, the sequence (Ak) also converges to a limit A. 
Let U be a sufficiently small neighbourhood of the origin in C n x C n 
and let 

M := {(A, fi) G U n (C n x C(o)) : A(X, n) = 0}. 

The projection of M to C n defines a subset K over which the sequence 
(gk) converges to a limit g and M is the graph of g. 

By general arguments, we will deduce that the sequence (VD^) is 
summable in L 2 (K) (Proposition 15. 3j) and 

fc>0 

This implies that M is contained inside C n x 3{H). Using the Whitney 
extension theorem, we extend the function 

9 '■ K — > 2(H) 

to an open neighbourhood in the space of the variable A. In this way, 
we get an Z-curved map in 2(H) and a C°° lagrangian normal form. 

To recover the C°° Birkhoff normal form from the lagrangian one, 
we show that g is the gradient of J2k>o^k^ i- e - : 

£VDf = V$>f. 

fc>0 fc>0 

This is a consequence of the closed graph theorem. Indeed, write 

j=k j=k j=k 

By the mean-value theorem, the right hand-side is estimated by the 
gradient of Yl]=k Df , thus it tends uniformly to zero. Consequently 
the sequence (Y^=i Df , Ylj=i ^Df) converges towards a limit and by 
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the closed graph theorem 

j>l \3>l / 

This will conclude the proof of Theorem 12.11 

Our task will consists in a systematic study of the convergence in 
KAM-type iterations. We first consider the problem of small denomi- 
nators and construct a formalism similar to that introduced by Hamil- 
ton, Sergeraert and Zehnder for Frechet spaces for the differentiable 
case but with a difference which turns out to be fundamental : in our 
case, the scaled object is not a Frchet space but rather an LB-space. 
As we will see, the theory becomes more rigid but also richer, just as 
complex geometry with respect to differential geometry. 

4. Scaled vector spaces 

4.1. Definition. An S -scale of Banach spaces is a decreasing family 
of Banach subspaces (E s ), s e]0, S[ such that the inclusions 

E s+a C E s , s e]0,3[, a 6]0,5-s[ 

have norm at most 1. 

Let E be a topological vector space. An S-scaling of E is a scale 
(E s ) of Banach subspaces of E such that 

i) E = \J se]0>s[ E s ; 

ii) the direct limit topology induced by the inclusions E s C E coincide 
with that E. 

(Recall that a family f s : X s — > X of mappings between topological 
spaces X s to a set X induces a direct limit topology on X. It is defined 
as the finest topology for which the f s are continuous : 

U C X is open f^iU) is open in X s , for all s.) 

The interval ]0, S[ is called the scaling interval. 

According to the context, point ii) might be a requirement or the 
definition of the topology on E. 

If F is a closed subspace of a scaled vector space E then E/F is 
scaled by the Banach spaces : 

(E/F) s := Eg/ (E n F) a . 
The topological tensor product of scaled vector spaces E, F is scaled 

by 

(E®F) S := E S ®F S . 
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(Recall that the topological tensor product of two Banach spaces is the 
completion of the tensor product for the norm 

|| ® Vi\\ < inf 2J 

where the inferior limit is taken over all different ways of writing the 
vector as a sum [T3J 155].) 

An important property of scaled vector space E is that they admit 
a natural filtration 

E := D D £ (2) D • • • 

by the subvector spaces 

E {k) = {xeE:3C, r, \x\ s < Cs k , Vs < r}, fc > 

Definition 4.1. T7ie order 0/ an element x £ E, denoted by ord(x) is 
the biggest k such that x £ E^ k \ 

4.2. Morphisms of a scaled vector space. Let E, F be two S'-scaled 
vector spaces. 

A linear mapping from u : E — > F is a morphism if for any s £]0, S*[, 
there exists s' £]0, 5"[ such that u defines by restriction a continuous 
mapping from E s to F s i. This defines the category of scaled vector 
spaces. 

We denote by £ F) the vector space of morphisms from E to F 
and when E = F, we simply write for E). 

The space F) is endowed with the strong topology. In particu- 
lar, if (u n ) C £(E, F) satisfies the following property : "for all s £]0, S[, 
there exists s' e]0,S[ such that the restriction of the sequence to E s 
defines a sequence in £(E s ,F s r) which converges to the restriction of 
u" then (u n ) is convergent. 

4.3. r-bounded morphisms. Let us keep the notations of the pre- 
ceding subsection. 

Definition 4.2 ([8]). A k-bounded r -morphism (r < S) u £ L(E,F) 
of S-scaled vector spaces is a morphism for which there exists a real 
number C > such that : 
C 

\u(x)\ s < —r\x\ s+a , for any s£]0,r[, a e]0,r - s], x £ E s+a . 
o~ 

Note that the condition can be re-written, in a multiplicative way, 
as follows : 

C 

\u{x)\\ s < 7^ _ xjkgk \ x \*> for an y s e ]°' r ]' ^ ^]0, 1 [, x £ E s+a . 

In view of our applications, for k = 0, we will assume that the condition 
holds also for a = 0, so that u maps F s to F s . This assumption is 
unessential but it simplifies some of the notations. 
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A morphism is called k-bounded (resp. bounded) if there exists r 
(resp. exist r and k) for which it is a fc-bounded T-morphism. 

If u : E — > F is fc-bounded then u(E^ +k ^) C F^\ Indeed, putting 
in the definition of a bounded morphism a = s we get that 

\u(x)\ s < ^\x\ 2s - 

We denote by N^(u) the smallest constant C which satisfy the estimate 
in Definition 14. 2 [ it defines a norm for the space "B^E, F) of fc-bounded 
r-morphisms. 

We denote by S fc (£',F) the vector space of fc-bounded morphisms 
between E and F. One easily checks that the normed vector subspaces 
F), N*), re]0,S[ define an S-scaling of B k {E, F). 

Definition 4.3. A closed vector subspace F of a scaled vector space E 
is called k-direct (or simply direct) if it admits a complement G such 
that the projection on F along G is k-bounded with norm at most 1. 

A /c-bounded map whose square is equal to itself and with norm at 
most one will be called a k-projector. (The condition that the norm is 
at most one can be avoided but this would imply the introduction of 
additional constants in our arguments and will not be relevant for our 
applications.) 

4.4. Scaling the space of holomorphic function germs. We de- 
note by T(— , — ) the functor of global sections. For any open subset 
U C C n , the space of holomorphic functions in U is endowed with the 
topology of uniform convergence on compact subset of U. If the open 
subset U contains the origin, we get a restriction mapping 

r : F(U, OeO — ► Oj-o- 

Such mappings induce a direct limit topology on the local ring of holo- 
morphic function germs at the origin E := 0c n ,o- 

We now scale this vector space. Fix S > and s e]0, S[. We denote 
by E s the vector space of continuous functions in the polydisc 

D s := {z G C" : sup \zi\ < s} 

i=l,...,n 

which are holomorphic in its interior 

E s = T(mt(D s ),O C n)nC (D s ,C)- 

The C°-norm 

:= sup \f(z)\ 
zeD s 

endows E s of a Banach space structure and the sequence (E s ) defines 
a scaling of E [BJ [22] . 
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The Cauchy inequalities show that for this scaling, any order k dif- 
ferential operator on 0<c™,o defines a /^-bounded morphism. 

For the scaling (E s ), we have 

(Oe>,o) (fc) =M^ i0 . 

Consequently the nitrations induced by the scaling and by the power 
of the maximal ideal are identical. 

4.5. The L 2 -scalings. Another scaling (H s ) of 0c™,o is given by taking 
L 2 -functions instead of continuous ones : 

H s := r(int (D s ), C ») H L 2 (D S , C). 

It has a Hilbert space structure defined by the usual hermitian structure 

(f,g)^ I f(z)g(z)dv, 



JD S 

where dV is the euclidean volume on C n ~ M 2n . 
Now consider the identity mapping 

J : 0o,o — > 0c",o, / !->■ / 

and equip respectively the source and the target with the scalings (i? s ) 
and (H 8 ). 

As 

, - \ 1/2 

/ I/WIW) < sup \f(z)\ {V^s)\ 
the map J is 0-bounded and 

Proposition 4.1. The inverse of the map J is a 1-bounded morphism 
of norm at most equal to one. 

Proof. Denote by | • \ s the norm in H s . For fixed z G D s and a > 0, we 
put 

f(z + a) — a^er 7 , aj G C. 
i>o 

We have 

|/(z)| V = lool V < £ |a,| V^'+ 2 = / |/(z)| W. 
As ,2 + D a C -D S+(T , we get the estimate 



/ \f(z)\ 2 dV<[ \f(z)\*dV=\f\ 



S + CF' 



'z+D a 

This proves the proposition. □ 
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We now assume that the scaling interval ]0, S[ is such that S < 
The proposition shows that for the C°-scaling, the projection 
on a closed vector subspace is a 1-projector. 

Corollary 4.1. Let (f n ) be a sequence which converges to a limit f^, 
in 0<n™,o and 

(p : N — > N 

an increasing function. The maps g n obtained by truncating the Taylor 
series of f n at order <p(n) define a sequence which converges to foo. 

Proof. Equip the space of germs with the L 2 -scaling associated to the 
disks (D 8 ). To distinguish both scalings, we use the notations | • |f 
and | ■ |f° the corresponding norms. Obviously one has 

1*1? < IMf • 

Applying Proposition 14.11 we get that 

lo ic° < h Q \ L2 < hf \ L2 < hf ic° 

The function 

h-n fn 9ni 

is orthogonal to the polynomials of degree less than ip(n). Choose s 
such that f n G E s for all n's. We have 

lim \h n \f = 

n — >oo 



and, again by Proposition 14.14 we have the estimate 

i , i no 1 , , , r 2 

\h \° < —\h \ L 

\ ll n\s-cj — \""n\ 
a 



I s 



for any a < s. Consequently : 



lim =0 



n — >oo 



b n\ s - a 

This proves the corollary. □ 

Here is a first relation between bounded morphism and small denom- 
inators. 

Proposition 4.2. Let a 6 IR d be a (C,r)-diophantine vector and let 
us endow 0<c™,o with a L 2 -scaling (H s ) defined above. The Hadamard 
product 

g* : O c ™,o — ► 

with the function 

defines an (n + l)r-bounded morphism. 
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Proof. If / = Y,i> a i z% tnen 



As a is (C, r)-diophantine, we have 

CLi 



(a,i) 



Thus 



1. 2 Lj 2t I 1 2 



i/*^<^\/E 

Denote by L the linear operator (^i9 zi )(^2^ 2 ) . . . (z n d Zn ). The Cauchy 
inequalities imply that L is n-bounded for the associated C°-scaling. 
By Proposition I4.1[ it becomes an (n + l)-bounded for the L 2 -scaling. 
Moreover, 

£k| 2 ||i|H^< \ LT f\ 2 s- 

This shows that the Hadamard product with the function g defines an 
(n + l)r-bounded mapping and concludes the proof of the proposition. 

□ 

4.6. Cutt-off property. In the previous n, we introduced the L 2 - 
scalings. One can similarly define Z^-scalings. The limiting case p = 
+oo is of particular interest. Thus, we consider holomorphic functions 

/ = ^ai2 i er(iiitp a ) J 0cn) 

i 

such that the quantity 

\f\™ = sup Hs N , |i| := i t + i 2 H h i n 

i 

is finite. Note that the filtration associated to this scaling is the filtra- 
tion by the powers of the maximal ideal. 

Although elementary, the following proposition turns out to be fun- 
damental. 

Proposition 4.3. If f is of order N for the L°° scaling of 0<o,o 
associated to the polydiscs D s then 

N 

\f\T<\f\: 



I DO 
\S+(T 



s + a 



Proof. Put / = ^\i\> N aiZ 1 , we have 

1*1 / n \ N 



I ails'* 1 = loilfs + trV* 1 



s + a J ~ • \s + a 



□ 
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Now, we would like to obtain a similar result for the C° scaling of 
0c",o associated to the polydiscs (D s ), just as we did for projections in 
14.41 To do this, consider the identity morphism 

c n ,o 

where the source and the target are respectively equipped with the C° 
and L°° S-scalings (S < 1) that we denote respectively by (C°) and 

As the map K factors through the map J (see I4.5p . the morphism 
K is 0-bounded of norm at most equal to (a/tFt)™ 

Proposition 4.4. The inverse of K is n-bounded with norm at most 
equal to S n . 

Proof. For / G C s ° we have : 

i/(z)i < e k| s w = y: ^(s + at (^) M = {^-Jm, 

for any z G D s and a G]0, S — s[. □ 
This shows that for / G (C° +(T )( 2iV ) and s + a < 1 we have 

r-. AT 

l/l.<^l/U (-^ 

cr" \ S + cr 

Definition 4.4. A filtration J' of a scaled vector space E has the (C, k) 
cut-off property if 

2 N 

\f\s<^-\f\s + c 7 



\s + a 

for any s g]0, S[, a G]0, S - s[ and f G 3 rN (E s+r7 ). 

The vector space 0c n ,o with its C°-scaling and its natural filtration 
has a cut-off property. This property is hereditary : if E has the (C, k) 
cut off property then so does the scaled vector space E<§)F, for any 
scaled vector space with the grading 

3 n (E(g)F) = 1 n (E)®F. 

4.7. The exponential of a bounded morphism. If u,v are mor- 
phisms, respectively k and fc'-bounded, then the composed mapping uv 
is (k + /c')-bounded and we have the estimate 

N k+k '(uv) < 2 k+k> N k (u)N k \v). 

Indeed : 

\(uv)(x)\ s < N k (u)^\v(x)\ s+a/2 < N k (u)N k \v)^\x\ s+a 
for any x G E s+U . In slightly greater generality : 
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Proposition 4.5. The product of ki-bounded morphism Ui, i = 1, . . . , n, 
is a k-bounded morphism with k := Y^=i ^» an< ^ 

n 



N k T { Ul ---u n ) <n k JJjvJmv/ 



i=l 



Moreover if u\ = ■ ■ ■ = u n = u is 1-bounded then 

N n (u n ) 
r V '- <TNl(u) n . 
n\ 

Corollary 4.2. Let u be a 1-bounded r-morphism. If the estimate 
2>N] (u) < s holds for any s < r then the expansion 



j>0 J 



converges to a morphism in E and 

le u xlx <T (mKu)y = i 

i>0 V 7 1 (l-A)s 

/or any X e]0, 1 - ^^[, se]0,r] and x E E s . 

Definition 4.5. A 1-bounded r-morphism u is called exponentiable 
provided that for any s < r, the r-morphism u satisfies the estimate 
3N}(u) < s. 

Remark that two commuting 1-bounded exponentiable morphisms 
u, v e ^r(E) satisfy the equality 

e u+v = e u £ v_ 

Consider, for instance, the vector space of holomorphic function 
germs in one variable C{z} := 0c,o- Endow this vector space of the 
scaling defined in 14.41 In this case, the norm of a derivation 

ad z , a G E T C C{z} 

viewed as 1-bounded morphism, is simply the C°-norm of a. Therefore, 
exponentiable 1-bounded morphism satisfy a(0) = 0. This condition is 
however not sufficient but, as one may check, if a'(0) is also equal to 
then it is exponentiable. More generally, any 1-bounded map of order 
2 is exponentiable. 



4.8. Main theorem. We saw that the algorithm for constructing nor- 
mal forms involves an infinite composition of exponentials. The fol- 
lowing result will be a fundamental test for the convergence of such 
products. I do not know any similar result in the Hamilton-Sergeraert- 
Zehnder setting. 
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Theorem 4.1. Let E be a scaled vector space and (u n ) C ^>\{E) a 
sequence of exponentiable 1-bounded r-morphisms. If the inequality 



Ui) < s 



i>0 



holds then the sequence of morphisms (g n ) defined by 

g n ■= e ^e«n-l . . . e u 

converges to an invertible element in £>(E). 

Proof. To prove the theorem, we start with the 

Lemma 4.1. Let (u n ) be a sequence of exponentiable 1-bounded r- 
morphisms. For any s <r and any x G E s , we have the estimate 

1 

provided that A is such that 

3 

ma x T\ vT^ 1 ^) < L 

t<n [1 — \)s 

Proof. Denote by Cj jU C Z- 5 , the multi-indices I = with 
coordinates in {0, . . . , n}. We have the formula 

for any a G 1, I = iz, . . . , i n ). 

For / = G Cj <n , we denote by cr(J) the vector whose 

components are obtained by permuting those of / in such a way that 
cr(I) p > o-(I) p+1 . 

We put 

u[I] := u a{I)l u a{I)2 ■ ■ / G Cj, n . 

We expand g n and collect the terms in the following way : 

^ n ^ n n n 

9n = J2^ E ^W) = 1 + E^ + 2E M ' + E Yl 2^) + .... 

j>0^' l£Cj, n i=0 i=0 j=0 i=j+l 

Fix s and put 

^:=iV s 1 K)iV s 1 ( Mi2 )...iV s 1 ( Mj J. 
Proposition 14.51 implies the inequality : 



^m(u[i})<yf[Ni(u tp ) = y Zl 
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and therefore 



\u[I](x)\ Xs < (^J^-J ZI \ X \ S , VA g]0,1[. 



Put 

a = 



(i-xy 

we get the estimate 

\9nX\xs < I £V Zl \ \ X \° = 1_n^V" 

This proves the lemma. □ 
We may now conclude the proof. Fix s g]0, r]. As 

3^iV s 1 ( Ml )<s, 

i>0 

we may choose A g]0, 1[ such that 

3^iV s 1 (^)<(l-A) S . 

Let us denote by || • \\\ the operator norm in L(E s ,E Xs ). The pre- 
ceding lemma gives the estimate 

1 

'^ n " A - 1 - 3 V NHu)' 

Thus the sequence (g n ) defines by a uniformly bounded sequence in 
£(E S , E^xs). We now prove that this is a Cauchy sequence. 

Let /i g]0, 1[ be such that 

3 sup Nl s (ui) < (1 — n)Xs. 

i>0 

I assert that the sequence with elements \\g n — g n ~i\\x^ is convergent. 
To see it write 

9n ~ 9n-i = (e Un - Id)^ n _i 
where Id G £>(E) denotes the identity mapping. 
Expanding the exponential, we get that : 

for any y G E\ s . Take y = g n x, this gives the estimate 

ll(e .._ Id)9 „_ l|lv ,^%_^. 

< Xs 
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As n goes to infinity, the quantity N\ s {u n ) goes to zero therefore 

3C X 

Kx ^ '■= SU P 3jV i (u ) 

n>0 1 _ n _ -^Xs^l 
As 

is finite. We have proved the estimate 

lln a \\ <K NM 
WQn ~ 9n-l\\Xfi S J^X,fi • 

This shows that 

HSWp - 9n\\\ti < ^2 Wdn+i - Qn+i—l \ \\fJ- < | ^ A S 

i=l \i=l 

thus (g n ) defines a Cauchy sequence in the Banach space L(E S , E^). 

This shows that the sequence (g n ) converges to an element g G £{E). 
A similar proof shows that the sequence (h n ) defined by 

h n = e- Uo e~ Ul ■ ■ ■ e~ Un 

converges to an element h G £(E). For all n G N, we have : 

g n h n = h n g n = Id 

therefore gh = hg = Id . This finishes the proof of the theorem. □ 
We now turn to KAM theory in the setting of scaled vector spaces. 



5. Approximations of scaled vector spaces 



5.1. Tamed morphisms. Let S > and E := n^«' F :=Y\F n be 
products of 5-scaled vector spaces. We abuse notations by omitting the 
index n in the norms of the E^s and the F n 's. A morphism u : E — > F 
is a collection of morphisms of scaled vector spaces : 

u • E — y F 

The notions defined for scaled vector spaces extend naturally to se- 
quences. For instance, if the /c-bounded, we say that u is k- 
bounded with norm 

N k T (u) := (N*(u n )). 
Definition 5.1. A r -morphism 

u = (w n ) :E-^F 
is called k-tamed if the real sequence N k {u) is of exponential class 2. 

We denote by 'M k (E, F) the vector space of fc-tamed morphisms. 

Let us endow the vector space E := Oc^o with a C°-scaling described 
in 14.41 with S < 1. Next consider the sequence of scaled vector spaces 
(E n ) with E n = E for all n's. 
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Proposition 5.1. Let a G M. d be such that the sequence <j{a)~ 1 G M. d 
is of exponential A < 2. The Hadamard products 

On* '■ Oc d ,o >* Oc<2,0 

with the functions 

9n{z) ■= Y] T~\ z% 

define a l-tamed morphism whose norm is bounded from above by the 
sequence a(a)~ l . 

Proof. Let us endow Oc d ,o with a L 2 -scaling described in 14.51 If / = 
^ i>0 aiZ % then 



f*g n (z)= V ( 

(oi,i) 

\\i\\<2 n ,i^0 v ' 



i j 
Z . 



Thus 




\f*9n\* < -rrJ Yl \ a *\ 2 \ z % < -rr-\f\s- 



i\\<2 n ,i^0 



This shows that, for the L 2 scaling the map is 0-tamed with norm is 
bounded from above by the sequence a(a)" 1 . By Proposition l4.lt this 
concludes the proof. □ 

5.2. Approximations of scaled vector spaces. Let be a S- 

scaled vector space. 

Definition 5.2. An approximation of is a sequence of scaled vector 
spaces (E n ) together with sequences of O-bounded morphisms, called 
restriction morphisms : 



and 



such that 



Hiq — > Hi i — > h 2 — ; 



l) r o r n = r ; 

ii) the norm of r n , r is at most 1 for any n > ; 

(The assumptions that the restriction maps are O-bounded is rather 
unnatural and can be replaced by bounded but this would slightly 
complicate the notations and would not be used in our applications.) 

We sometimes abuse notations by omitting to specify (C, k) and 
avoiding the index n in the notation of the restriction map from E n to 
Eoo. We will also use the notation 

r k : Ei — > E k , i < k, 
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instead of r k o r k _i o---or r 

The limit of x = (xq, xi, . . . , x n , . . . ) G E is the limit of the sequence 
(rx n ) C Eoo, if it exists. 

The topological tensor product of two approximations is defined by 
taking the topological tensor product of each pair of scaled vector space 
in the approximation : 

{E®F) n>s := E n ^®F n ^ s . 

Definition 5.3. A (C, ^-approximation of E^ is an approximation 
by filtred scaled vector spaces (En,^) such that each (£^,3^) satisfies 
the (C, k) cut-off property. 

Definition 5.4. Let E^ be an S-scaled vector space. The approxima- 
tion defined by taking all E n 's equal to E and the identity mapping 
for restriction morphism is called the tautological approximation of the 
scaled vector space E. 

If we now endow 0c d ,o °f the canonical filtration induced by its scal- 
ing then the tautological approximation becomes a (C, /^-approximation 
with C = S d , K — d (see Proposition I4.4[) . 

5.3. Ultra-violet approximations. Given a closed subset K C C d , 
we denote by C\ the sheaf on K of complex-valued A;- Whitney differ- 
entiable functions [43]. 

Scaled vector spaces are associated to specific choices of neighbour- 
hoods. In the case of approximations, these neighbourhoods depend 
on an index neN, i.e., we consider a two-parameter family of closed 
neighbourhoods of a point a G C d : 

(K n>s ), neN,K n , s cC d 

depending both on an index n and on a parameter s g]0, S[. We denote 
by Koo^ their intersection over n G N and put 

We assume that 

for any s G]0, S[, a g]0, S - s[, n G N. 

Let us now spell out the set (K n ^ s ) that we will use. We fix a real 
positive decreasing sequence a = (a*.) and define the closed subset 

C(a) n := {a G R d : a{a) k > a k , VA; < n} 

so that the intersections of these sets is the arithmetic class associated 
to a in R d . 
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Definition 5.5. Let B s be the closed polydisk of radius s centred at 
a G C(a). The compacts sets 

K niS = C((l - s/2)a) n n B s 

are called the ultra-violet neighbourhoods of a G M. d associated to the 
sequence a. 

Our scaling are chosen so that z G K nyS the polydisk centred at z of 
radius a n a/2 is contained in K niS+a . 

For any k,n, the neighbourhoods (K n>s ), s g]0, S[ give a C fc -scaling 
of the vector space E n : = Oc d ,« : 

E n , s := T(int (K niS ), o cd ) n C k (K ntS , C) 

with the C fc -norm : 

|/| s fc := sup \d?f(z)\. 

zeK n , s ,\P\<k 

The "limiting" vector space C^- ma is also scaled by : 

Eoo, s ■= c fc (-^oo,s n B s , C). 

This approximation (E n ) is called the C k -ultra violet approximation 
at the point a G W 1 associated to the sequence a. 

In addition to C k ultra-violet approximations, there are also Gevrey 
ultra-violet approximations {G 3 K ), for any fixed j > 1. The Gevrey 
norms are defined using the C fc -norms defined above 



k 
s 



I 1 1 1 l s 

:= sup 



In general, the canonical filtration does not satisfy a cutt-off property 
and this is the reason for which we introduced the harmonic filtration 



We may now state a variant with parameters of Proposition 15.11 Let 
us endow the vector space 0<c d ,o with the C°-scaling described in 14.41 

Proposition 5.2. Let E be a C° ultra-violet approximation in C d as- 
sociated to a decreasing sequence a. If the sequence a -1 is of exponential 
class 2 then the Hadamard products 

g n * : E n ®(M 2 cdfl ) n — y E n ®(M 2 cdfi ) n 

with the functions 

g n (a,z):= V" T J —z l ,a i eC 

||«||<2",i^0 v ' 

define a 1-tamed morphism whose norm is bounded from above by the 
sequence a" 1 . 
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The proof is straightforward. 

5.4. L 2 approximations and truncations. Consider now a two pa- 
rameter family of relatively compact open subsets (U ntS ), decreasing 
inn 6 N and increasing in s ; and let us replace the C°-scalings by 
L 2 -scalings : 

K, s :=r([/„, s ,o ci )nL 2 (f/ niS ,c). 

with 

L 2 Koo := limL^tfoo^C), K^ s := pj U n;S . 

neN 

In general, the vectors z 1 , I G N d , are not orthogonal in E n>s , so we 
cannot apply directly the arguments of 14.51 for truncations, neverthe- 
less : 

Proposition 5.3. Let (f n ) be a summable sequence in E and ip : 
N — > N an increasing function. The maps g n obtained by truncating 
the Taylor series of f n at order if (n) define a summable sequence and 

n>0 n>0 

Proof. The proof the proposition is based on the following lemma. 

Lemma 5.1. Let Q C lR d be a relatively compact open subset. Let H C 
L 2 (Q) be the closure of the vector subspace generated by the monomials 
z 1 , I G N d . Then for any aiz 1 G H and any k > 0, we have 

I yj a-i^W < || aiz 1 ]] 
\I\<k JeN d 

Proof. Put 

f :=Y^ aiZ ^ 9-=Yl aiZ *- 
7eN d |/|<fc 

Fix e > 0, we may find disjoint open polydisks D\, . . . , D n in Q such 

that 

/ i 9 i 2 = E/ m 2 +- 

Denote by (Hi, ]\ ■ ||») C L 2 (Di) the closure of the vector subspace 
generated by the monomials z 1 , I G N d . Let q G -Dj be the center of 
the disk. The polynomials (z + Ci) 1 form an orthogonal basis of Hi. 
Write 

we get that 

9 '■= Yl a 'i( z + c iY 
\J\<k 
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therefore 

\g\ 2 = (EKHI^ + q) 7112 

Di \\I\<k 

Summing over all i's, we get that 

/ \g\ 2 < [ \f\ 2 + e 

Jn Jn 

for any e > 0. This proves the lemma. □ 

The lemma implies that (g n ) is summable. Now, choose s small 
enough so that r(f n — g n ) G L 2 (_ft' 00jS ), for any n 6 N. The function 
r(f n — g n ) belongs to the closure of the vector subspace generated by 
the monomials z 1 , I eN d and is orthogonal to the z 7 's with \I\ < <p(n) 
thus the limit of the sequence (r(/ n — <?„)) is equal to zero. This proves 
the proposition. □ 

We may now compare L 2 and C° ultra-violet approximations, in a 
way similar to what we did in 14. 51 for scaled vector spaces. Let us denote 
by E' = (E' n ) a C°-ultra violet approximation associated to compact 
subsets (K niS ) and let C/ n>s be the interior of K nyS . The identity mapping 
induces a 0-bounded morphism. 

J : E' — ► E, f H- /. 

Note that for 2; e -ft'n.s, the polydisk centred at z of radius a n a/2 is 
contained in i^„ )S+(J , thus Proposition 14.11 admits the following gener- 
alisation : 

Proposition 5.4. The inverse of the map J is a 1-bounded morphism 
of norm at most (2a" 1 ). 



Combined with Lemma [5TTT this proposition has the following corol- 
lary : 

Corollary 5.1. Let (f n ) be summable sequence in E' and if : N — > N 

an increasing function. Assume that there exists s, A, B such that for 
any n 

\fn\ s < Ae~ Bn 

If the sequence a -1 has exponential type less than B then the maps g n 
obtained by truncating the Taylor series of f n at order f{n) define a 
summable sequence. 

Proof. To distinguish the approximation E' and E, we use the notations 
I • If' 2 and I ■ the corresponding norms. Choose a < s, Proposition 15.41 
implies that 

I \C° ^ I \L 2 

\9n\s~a — \9ri\s • 
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By Lemma [5 ■![ we get that : 

\iJn\s _i \Jn\s ■ 

As a -1 has exponential type less than B and 

\f \ L2 <\f \ c ° 

\Jn\s — \Jn\s ) 

there exists A' such that 

In \ c ° < A'p- 3 " 

\9n\s-a ^ A e 

for any n. This proves the corollary. □ 

5.5. Regularity lemmas. We will now give the arguments that en- 
sure that the C°-convergence implies the C°° convergence and even the 
Gevrey convergence in the diophantine case. In the context of the clas- 
sical KAM theorem (and therefore of diophantine approximation), the 
first observation is due to Pschel and the second one to Popov [271 EH] • 

In this n, E k denotes the C k ultra-violet approximation associated 
to a sequence a and the polydiscs D s C W 1 of polyradius (s, s . . . , s) 
centred at a vector a G C(a). For the norm of / G E k s we omit the 
indices k, n and write simply |/| s . 

Lemma 5.2. Consider a j-bounded r-morphism u° = (u®, . . . , it°, . . . ) 
of E°. For any o > 0, u° induces a (r — a)-morphism u k of E k such 
that 

Proof. The Cauchy inequalities imply that for any x G E 

2 2k ki 2 j+2k k^ 
\u k (x)\ s ^ < — — l| u °(7r(a;)) U_ ct/2 < — Tjnr^K)k(a:)|. 

and obviously 

\tc{x)\ s < \x\ s 

This proves the lemma. □ 

In particular, if u is a fc-tamed r-morphism in E° then for any a < r, 
it defines a tamed morphism in E k . 

We denote be the Gevrey approximation of order j associated to 
C(a). 

Lemma 5.3. Consider a j-bounded r-morphism u = (ui, . . . , u n , . . . ) 

of E°. For any o > ; u induces a (r — o~)-morphism v of G J +2 such 
that 

Ni_ (v n ) <p n N 3 T {u n ) 

with 

Pn ■= SUp 



k 



k / (k\y+^+ k a k 
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Moreover, if a is a (C,u) -sequence and u is tamed then the map v is 
also tamed. 

Proof. The previous lemma gives the estimate 

Ni_ a (v n )<p n N j (u). 



By Lemma [1. 11 we can choose B < 2 such that the sequence 

is of exponential type B. This proves the lemma. □ 

5.6. Infinite products. We start by extending the criterion of The- 
orem 14.11 to approximations. Let E be an approximation of a scaled 
vector space E^. 

Theorem 5.1. Let u be an exponentiable 1-bounded r-morphism of E. 
Assume that there exists A g]0, 1[ such that the norm of u satisfies the 
estimate 

3j2N$(u n )<(l-\)s 

n>0 

for any s <t, then the sequence (r g n ) defined by 

9n = e Un r n ---r 2 e Ul r l e U0 
converges in L(E , E^) and maps E s to E ps with p = A(l — A). 



Proof. The proof is similar to that of Theorem 14. 1L We have 

3^iV s x K)<(l-A) S 

n>0 

and 

3 sup Nl(u n ) < 3 J2 N lM < A(l - X)s = ps. 

— n>0 

Denote by || • \\ P}n the operator norm in the vector space £(E Sj o, E ps>n ). 
Expanding the exponential, we get the estimate : 

(*) \\9n ~ Tngn-^n < K X ^^1 

AS 

with 

3C A 

Kx := SUP 3A ri , ) 

n>0 1 _ \ _ \ u n) 
As 

and 



„>0 (l-A); 



This proves that the sequence (rg n ), n > j defines a Cauchy sequence 
in the Banach space £(E Si0 , E pSt00 ). □ 
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We are now ready to re-formulate the KAM theorem in an abstract 
setting. 

6. The generalised KAM theorem 

6.1. Statement of the theorem. To state the theorem, we need some 
preliminary definitions. 

Definition 6.1. A k-right quasi-inverse of a morphism u : E — > F 
between two (C, n) -approximations is a mapping v : F — > E such that 

i) for any n eN,y E F n , y - u n v n (y) G J n {F n ) ; 
ii) for any n, the map Id — u n v n is a k-projector. 

Definition 6.2. A map (non-necessarily linear) f : E — > F between 
two sequences of S -scaled vector spaces E = (E n ),F = (F n ) is called 
l-tamed if the sequence 

p n := sup{a l ^ f ^ s : 8 G]0, S[, a g]0, S-s[,xe E n , s+ „} 



I S+cr 



is of exponential class 2. 

Any tamed morphism is of course a tamed map. 

Theorem 6.1. Let (E,^') be a (C, ^-approximation, g a vector sub- 
space ofJA 1 {EY 2 \ a G E and M a subapproximation of E such that 
a + M is e s -invariant. Let F be an m-direct subspace of M and consider 
the maps 

p(a) : g — > M/F, u i-> u(a + a), 

for a G F . Let 

j:F^M k (M/F,g) 

be an l-tamed map such that j(a) is a m-right quasi-inverse of p(a). 
For any x G a + MQ 2k+2l+m+3 \ there exists a tamed morphism u = (u n ) 
of E and an integer N such that 

i) the sequence g n := re Un r n e Un '^ . . . e U2 r2e Ul r\e u ° converges to an el- 
ement g G £(-Eo, Eoo) ; 

ii) g(x) = r(a)(modFoo) ; 

iii) for any A < 2, there exists rj > such that for any s sufficiently 
small, we have N^(u n ) < i]s 2 e~ An . 



By Theorem 15. 1[ point i) is a consequence of point iii). 

Before proving the theorem, let us now spell out the different objects 
entering into the theorem, in the particular cases of Theorem 13.11 and 
for the algorithm described in 13.81 
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6.2. Proof of Theorem GOl Let E be the tautological approximation 
on 0<c 2r \o with its C°-scaling (see \AA\ . We choose the filtration defined 
in 13.31 but the natural filtration can also be used in this case. The 
vector spaces E^s are all equal and any holomorphic function germ 
can be viewed as an element of E n , for any n. 

We take M to be M^ 2 « - The tautological approximation induces an 
approximation on the ideal J 2 , we put F = I 2 © C. The vector space 
g is the space of exact hamiltonian derivations. 

Finally, we take 

n 

a = H {2) = ^otiPiqi, 
i=i 

the maps p(a) n are 

/ ^ {f,HW + a}, a G I 2 . 

The inverse constructed in 13.31 depends on the first derivative of a so 
it is 1-tamed (I = 1). By Proposition 15.11 j(a) is a 1-tamed morphism 
with norm at most (a(a)), so that k = 1. By Proposition 14.11 an y 
vector subspace which is a direct-summand is 1-direct so m — 1 . 

The generalised KAM theorem implies the existence of a symplectic 
morphism g which transform + R to H^ 2 ' modulo I 2 for any R G 
M 8 

The first germ is viewed as an element of of E Q and the second one 
as an element in E^. This is of course a pure artefact since both 
spaces are equal. This holds only for R e M^n.o so one might think 
that this is not sufficient to conclude. However, the Birkhoff normal 
form procedure shows that under the action of symplectomorphisms 
the germ + R contains in its orbit, an element of the type : 

H {2) + R N mod/ 2 , R N G M^L 

for arbitrary N, therefore the assumption that R G M^ 2 ™,o is not re- 
strictive. 

6.3. Proof of Theorem 12.11 The notations are those of 13.81 In this 
case, E is a topological tensor product of two approximations : the 
C°-ultraviolet approximation associated to 

K n = {(A, p) G C 2 ™ : a + D c (p) + 2Q(X + p) G C(a) n } 

and a system of neighbourhoods of the frequency vector a, the tauto- 
logical approximation on 0c 2 ™,o = C{g,p}. 

An element of (E n ) s is a continuous function in a neighbourhood of 
the origin, holomorphic inside its domain, depending on the \,p,q,p 
for 

a + D c (p) + 2Q(A + p) G e((2 - s)a) n . 
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The approximation F is the induced approximation on (J 2 ©C{A, p}Y 5 \ 
M is E^ . We define g as the space of sequences u = (u , u n , . . . ) 

where «, is the sum of an exact hamiltonian derivation with a C{A}- 
linear derivation of C{A,/i} : 



{fi,-} + Vi, freE®, ^GDer c{A} (C{A,/i}) 



such that u is a tamed morphism. Note that the exponential of is 
the flow at time 1 of a hamiltonian vector field. 

In this context, a = Gq + Ro and the maps p(a) n are all defined by 
the same expression 



g n i y M n /F n , (f, v) i y {f, Go + a} + «(/) 
with a G (/ 2 ©C{A,/i}) n . 

As before, the inverse constructed in 13.81 depends on the first de- 
rivative of a so it is 1-tamed (I = 1). By Proposition \b.'2\ j(a) is a 
1-tamed morphism with norm at most (a (a)), so that k — 1 again and 
by Proposition 14. 1[ we have m = 1. 

According to the theorem for any R G M^ 2 „ , there exists a Poisson 
morphism g which transforms Gq + Rq to Go modulo I 2 , where the first 
germ is viewed as an element of of Eq and the second one as an element 
in Eoo. 

6.4. The KAM iteritation in the abstract setting. We come back 
to the iteration introduced in l3.3l and l3.8l The germs Gj (or Hi), B^, Ri 
are now denoted Oj, a«, 

Let 

7Ti : E — y F, ir 2 : E — > G 
be m-bounded projections on F and its complement G. 
For n = 0, we take 

a = a, = 6, Mo = jo(b), a = 7Ti(0 o -u {a )), 7 = 7r 2 (/3 -tt (ao))- 

The iteration depends on the choice of some appropriate integer N, 
that will be fixed later. We define 

(j) : N — > N 

by 

(j)(n) -- 

We define the quantities 



N for n <N 
n for n > N 



Pn+i = r^ n+1) (e~ Un (a n + /?„)_- a n+1 ); 
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We put 

= Ki((3n+i — u n+1 a n+1 ); 
ln+i = ^(Pn+i - u n+1 a n+ i). 

Once this is done, this shows that (rg n x) converges to a limit x' G 
a + F and therefore concludes the proof of the theorem. Indeed, these 
estimate give 

lim r7„ + lim ra n = 0. 

n — >+oo n — !-+oo 

Using the definition of j, we get that : 

K + oi n + 7„ = u n (a n ). 

Taking the image under r of both sides in this equality, we get by 
passing to the limit : 

lim r(b n ) = (modFoo). 

n — ^+oo 

Moreover 

On+i + K+i = r n+1 g n {a + b) 

therefore by taking again the image under r of both sides and passing 
to the limit, we get that : 

r(a) = g(a + b) (modi^). 

Q.E.D. 

6.5. Preliminary lemma. Our estimates are based on the decompo- 
sition (3 n+ i as 

Pn+l = ?>(n+l)(Ai + B n + C n ), 

with 

' A n := (e- u "(Id +u n ) -ld)a n ; 

(e~ Un - Id)a n ] 
e~ Un ln- 

The next lemma will give estimates for quantites A n , B n , C n . 

Lemma 6.1. Let E be a scaled vector space. For any 1-bounded r- 
morphism u, any s e]0, r[ which satisfies the condition 

_ 3iV» 1 
" (r-s)-2 

and any x G E T , we have the inequalities 

1) |(e- u (Id +u) -Id)x\ s <4\x\ T t 2 ; 

2) \(e- u -ld)x\ s < 2\x\ T t ; 

3) |e"x| s < 2\x\ T . 
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Proof. The expansion 

e -«(id +u) -id=£fe^(-i) 



fn + 2)! 



n+1 n+2 



gives the estimate 



n>0 ^ 

As 



n>0 v 7 n>0 v ' 



< 1 

r — s 

the expansion of the left-hand side of the estimate is equal to 

t 2 



n>0 

As 



1 <4, Vte[0,~], 



(1 -t) 2 - ' 2 J 
we get the estimate i). Point hi) follows from Corollary 14.21 and point 
ii) is left to the reader. □ 

6.6. Setting up the sequences. The proof rely on the appropriate 
choices for the quantities which enter the notions involved in approxi- 
mations of scaled vector spaces. For this, we use the indices k, k, I, m 
and the constant A defined in the statement of the theorem. 

There are five sequences sequences to be defined : 

On), {q ), (Pn)- 

The first one estimates the growth of the norms of the quasi-inverses, 
while the second one is used to compensate this growth. The third 
one indicates loss of regularity at each step, that is if we start with 
an element in E Sn then we end up with an element in E Sn ^ r7n . The 
fourth sequence estimates the remainder at each step, while the fifth 
estimates the error committed by taking the ultra-violet cut-off in the 
linearised equations. 

So first, we use the assumption that j is tamed to get a uniform 
estimate for the norms of the j(a). We define the sequence (p n ) by 
putting 



1 + \a\ s +a 

if this upper bound is at least equal to one, otherwise we take p n := 1. 
The infinite products 



ill 



In '■— Y\Pn+i ~ Pn+lPn+2Pn+3 ' ' ' 



i>0 
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define a decreasing sequence bounded by 1 and 

Qn+l = Pn+l^n- 

As (p n ) is of exponential type 2 there exists constants C, B < 2 such 
that 

Pn <C'e Bn . 

thus 

1 

We now define (a n ). First, we choose a e]0, 1[ such that 

2 l - a > max(A, 5) 
and then take s small enough so that our initial data satisfy : 



a > — p a-s 



i,beE ps , p: =l + ^-L 



2 C 

with ps < 1. (Recall that eto = a is our initial vector and /?o = b is the 
perturbation.) 

Up to multiplication of the norms by a constant, we may assume 
that : 

1 

I Ips _ ?2 



This estimate avoids complicated coefficients when applying Lemma I6TT1 
Consider the sequence a := (cr n ) defined by 



1 

a n := s. 

" 2 an 



As 

> max(A,S), 

for sufficiently small rj > the sequence e := (e n ) defined by 



satisfies the estimates 



2 . 1 / (7n+1 N2fc+2i+m+2 

for any s G R + . 

Define the sequence (s n ) by 

s n+l '■= S n — CT n S, Sq = pS 

The (C, k) cut-off property implies that for any 7 G !P n (E n ) Sn , we have 
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Note that for m = n, we have 



1 



2" 



1 ~ e 

2 an 



__o(l— cn)n 



so the sequence (% n ) decreases exponentially faster than both sequences 
(q n ) and (e n ). 

6.7. Proof of Theorem 16.11 Up to multiplication of the norms by a 
constant, we may assume that : 

1 

\a\ps < — • 
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This estimate avoids complicated coefficients when applying Lemma IBTTl 

Up to changing s into a sufficiently small s' < s, we may assume 
that the following estimates hold for n = : 

i) \a„\s„-a n /3 < £ n ql ; 

ii) \Pn\sn < tnql ; 

ill) |7n| S „-2a„/3 < 4?n+l- 

Choose N big enough such that the sequence x '■= (Xn) defined by 

f 3 \ K ( 1 x 2 " (n) 



satisfies the estimate 
where 



N for n < N 
n for n > N 



Our task consists in showing the estimates i), ii) and iii) by induction. 
So assume that these are proved up to rank n. Using i) up to rank n, 
we get that 

n-l 
i=0 

Using the definition of the sequence (p„), we get the estimate : 

At-W3«..(X>)) < fi + E 1*0 s ir- 

i=o V i=0 / n n 

Thus u n = ]{Y^Zq \ot-i\si)fin satisfies the inequality 
i 3 k+l Pn 
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Combined with ii) , we get that : 



Qk+lp nk+l 



( \ at! ( \ ^ n 2 °n 

(*) N Sn _ 2an/3 (u n ) < —^rPnq n = —j^TQn+i- 

u n u n 

Point 1) of the lemma applied to A n and r — s = cr n /3 gives the 
estimate : 



14 1 < 



3 2 ^-K/aK) 2 



'■n\s n — a„ — o 



Using (*) at rank n, we get that : 

Ml < 2 f"„2 

\^n\s n -a n ^ 2 (fc+Z+l) y n+l" 



0"n 

By definition of the sequence (e n ) we get the estimate 

1 /<r n +i\ m 2 



14 1 <r i f^±I^ m 2 

— q I o j £ n+lQ n +l 



Point 2) of the lemma applied to B n and r — s = cr n /3 gives the 
estimate : 

Pn|s„-ff„ — <J | Q; ra|s n -2CT n /3- 

The induction hypothesis and (*) at rank n then give : 

3 k+l+1 et 



\B, 



?1 



By definition of the sequence (g„), we have 
thus 

IR I < 1 ( a n+l\ m ^ n 2 

\ JD n\s n -cr„ g ^ g J fc n+iy n +l- 



Point 3) of the lemma applied to C n and r — s = a n /3 gives the 
estimate : 



\a 



n\s„-<r„ ^ ^g^gy £ rt+iy., 



.2 

n+1- 



Combining these three estimates for A„, _B„ and C„, we get that 

I n I . /^n+A m 2 

|Pn+l|s n+ i S ^ g j £ n+lQn+V 

and in particular 

Sn+l 

This proves ii) at rank 

As n i is an m-projector and a n+ i = 7Ti(/3 n+ i), we get that 

\ a n+l\s n+1 -a n+1 /3 < £ n+l? n +l- 
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Let us now prove iii) using the estimate 

I n I . f°n+l\ m 2 

|Pn+l|s n+ i S ^ g J £ n+lQ n +V 

As 7„ + i G 3 r ^ n+1 )(E' n+ i), we have by the cut-off property : 

|7n+l|s n +i— 2<7 n +i/3 ^ Xn+1 \ln+l |s n +i — a n +i/3- 

As 7„ + i is obtained from f3 n+ i by an m-projector, we get that : 

/ 3 \ m 

|7n+l|s n+ i-o-„ + i/3 ^ I J Xn+l \fin+l \s n +i • 

Using the estimate obtained above for the norm of /3„+i, it implies 
that : 

II 2 2 2 

|7n+l|s„+i-2a„ + i/3 < Xn+l^n+lQn+1 < £ n+l ( ln+2- 

This proves iii) at rank n + 1 and concludes the proof of the theorem. 
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